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Abstract. This note contains two remarks. The first remark concerns the extension of 
the well-known Cayley representation of rotation matrices by skew symmetric matrices to 
rotation matrices admitting —1 as an eigenvalue and then to all orthogonal matrices. We 
review a method due to Hermann Weyl and another method involving multiplication by 
a diagonal matrix whose entries are +1 or —1. The second remark has to do with ways 
of flipping the signs of the entries of a diagonal matrix, C, with nonzero diagonal entries, 
obtaining a new matrix, E, so that E+A is invertible, where A is any given matrix (invertible 
or not). 
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1 The Cayley Representation of Orthogonal Matrices 

Given any rotation matrix, R G SO(n), if R does not admit —1 as an eigenvalue, then there 
is a unique skew symmetric matrix, S", {S'^ = —S) so that 

R= (I - s){I+ S)-\ 

This is a classical result of Cayley (1846) and R is called the Cayley transform of S. 
Among other sources, a proof can be found in Hermann Weyl's beautiful book The Classical 
Groups [7 , Chapter II, Section 10, Theorem 2.10.B (page 57). 

As we can see, this representation misses rotation matrices admitting the eigenvalue — 1, 
and of course, as det((J — S){I + S)~^) = +1, it misses improper orthogonal matrices, i.e., 
those matrices i? G 0(n) with det(i?) = — 1. 

Question 1. Is there a way to extend the Cayley representation to all rotation matrices 
(matrices in SO(n))? 

Question 2. Is there a way to extend the Cayley representation to all orthogonal matrices 
(matrices in 0(n))? 

Answer: Yes in both cases! 

An answer to Question 1 is given in Weyl's book Chapter II, Section 10, Lemma 
2.10.D (page 60): 

Proposition 1.1 (Weyl) Every rotation matrix, R G SO(n), can be expressed as a product 

R={I- S^)iI + S^)-\I - 52)(/ + 82)-', 
where Si and S2 are skew symmetric matrices. 

Thus, if we allow two Cayley representation matrices, we can capture orthogonal matrices 
having an even number of —1 as eigenvalues. Actually, proposition 11.11 can be sharpened 
slightly as follows: 

Proposition 1.2 Every rotation matrix, R G SO{n), can be expressed as 

R = [{I - s){i + sr^y 

where S is a skew symmetric matrix. 

Proposition 11.21 can be easily proved using the following well-known normal form for 
orthogonal matrices: 
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Proposition 1.3 For every orthogonal matrix, R G 0{n), there is an orthogonal matrix P 
and a block diagonal matrix D such that R = PD P^ , where D is of the form 



D 



\ 



\ 



... D, 



such that each block Di is either 1, —1, or a two-dimensional matrix of the form 
where < 9^ < n. 



cos 6i — sin 6i 
sin 9i cos 9i 



In particular, if i? is a rotation matrix [R G SO{n)), then it has an even number of 
eigenvalues —1. So, they can be grouped into two-dimensional rotation matrices of the form 

-1 0\ 

-l) ' 

i.e., we allow 6i = tt and we may assume that D does not contain one- dimensional blocks of 
the form —1. 

A proof of Proposition II. 31 can be found in Gantmacher Chapter IX, Section 13 (page 
285), or Berger j2j, or Gallier Chapter 11, Section 11.4 (Theorem 11.4.5). 

Now, for every two-dimensional rotation matrix 



T 



with < 6* < vr, observe that 



T2 



cos 9 — sin 6 
sin 6 cos 6 



cos(e/2) -sin(^/2)\ 
sin(^/2) cos(e/2) J 



does not admit —1 as an eigenvalue (since < 9/2 < tt/2) and T = (^^^ ■ Thus, if we 
form the matrix R^ by replacing each two-dimensional block in the above normal form 
by Dl , we obtain a rotation matrix that does not admit —1 as an eigenvalue, R = [R^ 



and the Cayley transform of i?2 is well defined. Therefore, we have proved Proposition 11.21 



□ 



Next, why is the answer to Question 2 also yes? 
This is because 
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Proposition 1.4 For any orthogonal matrix, R G 0{n), there is some diagonal matrix, E, 
whose entries are +1 or —1, and some skew- symmetric matrix, S, so that 

R = E{I - S){I + S)-\ 

As such matrices E are orthogonal, all matrices E{I — S){I + S)^^ are orthogonal, so we 
have a Cayley-style representation of all orthogonal matrices. 

I am not sure when Proposition ll.4l was discovered and originally published. Since I could 
not locate this result in Weyl's book [7j, I assume that it was not known before 1946, but 
I did stumble on it as an exercise in Richard Bellman's classic PP, first published in 1960, 
Chapter 6, Section 4, Exercise 11, page 91-92 (see also. Exercises, 7, 8, 9, and 10). 

Why does this work? 

Fact E: Because, for every n x n matrix, A (invertible or not), there some diagonal matrix, 
E, whose entries are +1 or —1, so that / + EA is invertible! 

This is Exercise 10 in Bellman (Chapter 6, Section 4, page 91). Using Fact E, it is 
easy to prove Proposition 11.41 

Proof of Proposition \1.4\ Let R G 0(n) be any orthogonal matrix. By Fact E, we can find 
a diagonal matrix, E (with diagonal entries ±1), so that I + ER is invertible. But then, as 
E is orthogonal, ER is an orthogonal matrix that does not admit the eigenvalue —1 and so, 
by the Cayley representation theorem, there is a skew symmetric matrix, S, so that 

ER= {I - S){I + S)-\ 

However, notice that E"^ = I, so we get 

R = E{I - S){I + S)-\ 

as claimed. □ 

But Why does Fact E hold? 

As we just observed, E"^ = I, so by multiplying by E, 

I + EA is invertible iS E + A is. 

Thus, we are naturally led to the following problem: If A is any nxn matrix, is there a way 
to perturb the diagonal entries of A, i.e., to add some diagonal matrix, C = diag(ci, . . . , c„), 
to A so that C + A becomes invertible? 

Indeed this can be done, and we will show in the next section that what matters is not 
the magnitude of the perturbation but the signs of the entries being added. 
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2 Perturbing the Diagonal of a Matrix to Make it In- 
vertible 



In this section we prove the following result: 

Proposition 2.1 For every n x n matrix (invertible or not), A, and every any diagonal 
matrix, C = diag(ci, . . . ,Cn), with q 7^ for i = 1,. . . ,n, there an assignment of signs, 
ei = ±1, so that if E = diag(eiCi, . . . , e„c„), then E + A is invertible. 

Proof . Let us evaluate the determinant oiC+A. We see that A = det(C+^) is a polynomial 
of degree n in the variables ci, . . . , c„ and that all the monomials of A consist of products of 
distinct variables (i.e., every variable occurring in a monomial has degree 1). In particular, 
A contains the monomial Ci ■ ■ - Cn- In order to prove Proposition 12.11 it will suffice to prove 

Proposition 2.2 Given any polyomial, P{xi, . . . ,Xn), of degree n (in the indeterminates 
Xi, . . . ,Xn and over any integral domain of characteristic unequal to 2), if every monomial 
in P is a product of distinct variables, for every n-tuple (ci, . . . , c„) such that Q 7^ for 
i = 1, . . . ,n, then there is an assignment of signs, = ±1, so that 

P(eiCi, . . . ,e„c„) 7^ 0. 

Clearly, any assignment of signs given by Proposition 12.21 will make det{E + A) 7^ 0, 
proving Proposition 12.11 □ 

It remains to prove Proposition 12.21 

Proof of Proposition WM We proceed by induction on n (starting with n = 1). For n = 1, the 
polynomial P{xi) is of the form P{xi) = a + 6x1, with 5 7^ since deg(P) = 1. Obviously, 
for any c 7^ 0, either a + 6c 7^ or a — 6c 7^ (otherwise, 26c = 0, contradicting 6 7^ 0, c 7^ 
and the ring being an integral domain of characteristic 7^ 2). 

Assume the induction hypothesis holds for any n > 1 and let P( Xi, . . . , ^n+i) be a 
polynomial of degree n + 1 satisfying the conditions of Proposition 12.21 Then, P must be of 
the form 

P{X\^ . . . , Xfi^i Xn+l) ^(^^Ij • • • ; Xri) ~\~ ^(Xi, . . . , Xn^Xn+l^ 

where both Q(xi, . . . , x„) and S{xi, . . . polynomials in xi, . . . , x„ and S'(xi, . . . , x^) 

is of degree n and all monomials in S are products of distinct variables. By the induction 
hypothesis, we can find (ei, . . . , e„), with ej = ±1, so that 

S'(eiCi, . . . ,e„c„) 7^ 0. 

But now, we are back to the case n = 1 with the polynomial 

(5(eiCi, . . . , e„c„) + S'(eiCi, . . . , enC„)x„+i, 
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and we can find e„+i = ±1 so that 

P(eiCi, . . . , enCn, e„+ic„+i) = Q{eiCi, . . . , e„c„) + S{eiCi, . . . , e„c„)e„+ic„+i 7^ 0, 

establishing the induction hypothesis. □ 

Note that in Proposition 12 .If the q can be made arbitrarily small or large, as long as they 
are not zero. Thus, we see as a corollary that any matrix can be made invertible by a very 
small perturbation of its diagonal elements. What matters is the signs that are assigned to 
the perturbation. 

Another nice proof of Fact E is given in a short note by William Kahan 0. Due to its 
elegance, we feel compelled to sketch Kahan's proof. This proof uses two facts: 

(1) If A = {Ai, . . . , A„_i, U) and B = {Ai, . . . , An-i, V) are two n x n matrices that differ 
in their last column, then 

det{A + B)= 2"-i(det(A) + det(S)). 

This is because determinants are multilinear (alternating) maps of their columns. 
Therefore, if det(A) = det(-B) = 0, then det(y4 + S) = 0. Obviously, this fact also 
holds whenever A and B differ by just one column (not just the last one). 

(2) For every k = 0, ... 2*^ — 1, if we write k in binary as k = fc„ • ■ ■ fci, then let Ek be 
the diagonal matrix whose ith diagonal entry is —1 iff ki = 1, else +1 iff ki = 0. For 
example, Eq = I and -E'2"-i = —I- Observe that E/^ and -E^+i differ by exactly one 
column. Then, it is easy to see that 

Eq + El + ■ ■ ■ + £'2"-! = 0. 

The proof proceeds by contradiction. Assume that det(/ + E^A) = 0, 
for /c = 0, . . . , 2" — 1. The crux of the proof is that 

det(J + EoA + I + EiA + I + E2A + ■■■ + I + ^2--!^) = 0. 

However, as Eq + Ei + ■ ■ ■ + -E2"-i = 0, we see that 

I + E0A + I + EiA + I + E2A+--- + I + E^r^^iA = 2"/, 

and so, 

= det(/ + EqA + 1 + EiA + J + E2A + ■•■ + / + ^2^.1^) = det(2"J) = 2" 0, 
a contradiction! 

To prove that det(J + EqA + I + EiA + I + E2A H h / + ^2"-iA) = 0, we observe 

using fact (2) that, 

det(J + E2iA + / + E2i+iA) = det(/ + E2iA) + det(/ + E2i+iA) = 0, 
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for i = 0, . . . , 2" ^ — 1; similarly, 

det(/ + E^iA + 1 + E^i+iA + J + ^4i+2^ + / + ^4i+3^) = 0, 
for 2 = 0, ... , 2""^ — 1; by induction, we get 

det(/ + EoA + I + EiA + I + E2A + ■ ■ ■ + I + £^2"-i^) = 0, 
which concludes the proof. 
Final Questions: 

(1) When was Fact E first stated and by whom (similarly for Proposition ll.4|l ? 

(2) Can Proposition 12. II be generalized to non-diagonal matrices (in an interesting way)? 
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